A transparent and transportable methodology for evaluating Data Linkage software

نویسندگان

  • Anna M. Ferrante
  • James H. Boyd
چکیده

There has been substantial growth in Data Linkage (DL) activities in recent years. This reflects growth in both the demand for, and the supply of, linked or linkable data. Increased utilisation of DL "services" has brought with it increased need for impartial information about the suitability and performance capabilities of DL software programs and packages. Although evaluations of DL software exist; most have been restricted to the comparison of two or three packages. Evaluations of a large number of packages are rare because of the time and resource burden placed on the evaluators and the need for a suitable "gold standard" evaluation dataset. In this paper we present an evaluation methodology that overcomes a number of these difficulties. Our approach involves the generation and use of representative synthetic data; the execution of a series of linkages using a pre-defined linkage strategy; and the use of standard linkage quality metrics to assess performance. The methodology is both transparent and transportable, producing genuinely comparable results. The methodology was used by the Centre for Data Linkage (CDL) at Curtin University in an evaluation of ten DL software packages. It is also being used to evaluate larger linkage systems (not just packages). The methodology provides a unique opportunity to benchmark the quality of linkages in different operational environments.

منابع مشابه

A framework for estimating the applicability of GAs for real-world optimization problems

Genetic Algorithms (GAs) have been gradually identified as an optimization-problem solver for certain classes of real-world applications. As GAs are increasingly utilized, a foundational study on how well GAs can perform with respect to varying problem domains becomes crucial. Yet, none of the prevalent theoretical studies are built upon the linkage between the theory and application of GAs. Th...

متن کامل

The Relationship Between Non-Transparent Financial Reporting and Risk Stock Futures Fall Due to the Size and Performance

The purpose of this study was to investigate the relationship between stock futures fall risk with non-transparent financial reporting at three levels of size, efficiency and return on equity, in the period 2010 to 2014 was in Tehran Stock Exchange. The population of the study are all companies listed in Tehran Stock Exchange. Data collected and calculated by using Excel software Eviews 7 been ...

متن کامل

Regional Efficiency, Innovation and Productivity

A blossoming stream of the regional innovation systems (RIS) literature is being devoted to investigate the relationship between RIS efficiency and productivity growth. Our study aims at evaluating: first, the ex-post relative technical efficiency in innovation in a sample of OECD regions by means of a DEA (data envelopment analysis) methodology. We will also match these results with regression...

متن کامل

Agent Tcl: A transportable agent system

Agent Tcl is a transportable-agent system that is under development at Dartmouth College. A transportable agent is a named program that can migrate from machine to machine in a heterogeneous network. Such programs are a powerful tool for implementing information agents since the electronic resources in a user's information space are often distributed across a network and can contain tremendous ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • Journal of biomedical informatics

دوره 45 1  شماره 

صفحات  -

تاریخ انتشار 2012